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METHODS, SYSTEMS AND COMPUTER PROGRAM PRODUCTS FOR 
FILTERING CONTENT DISPLAYED IN A TREE MAP VISUALIZATION 

FIELD OF THE INVENTION 

The present invention relates generally to displaying data on a display 
device. More particularly, the present invention relates to controlling the display 
of data that is displayed on "tree map" visualizations. 



BACKGROUND OF THE INVENTION 

As computer technology advances, computing systems have undertaken the 
management and processing of larger data systems. With data systems ranging 
from massive standalone databases to vast distributed networks, oftentimes the 

20 limiting factor in analyzing the state of a given system rests not with computing 
resources, but with the human operator. Specifically, though the computing 
system may aggregate vast quantities of data in near real-time, in many cases, a 
human being must visualize the compilation of data to draw effective conclusions 
from the visualization. Yet, the ability of the end user to digest compiled 

25 information varies inversely with the amount of data presented to the end user. 

Where the amount of compiled data becomes excessive, it can be nearly impossible 
for a human being to adequately analyze the data. 

In an effort to address the foregoing difficulties, tree-map visualization 
methods have been developed. Initially proposed by Brian Johnson and Ben 

30 Shneiderman in the paper, Johnson et aL, Tree-Maps: A Space-Filling Approach to 
the Visualization of Hierarchical Information Structures, Dept. of Computer 
Science & Human-Interaction Laboratory (University of Maryland June 1991), 
tree-map visualization techniques map "hierarchical information to a rectangular 2- 
D display in a space-filling manner" in which the entirety of a designated display 
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space is utilized. Additionally, "[interactive control allows users to specify the 
presentation of both structural (depth bounds, etc.) and content (display properties 
such as color mappings) information." Tree-map visualization techniques can be 
compared in a contrasting manner to traditional static methods of displaying 
5 hierarchically structured information. 

According to conventional static methods, a substantial portion of 
hierarchical information can be hidden from user view to accommodate the view of 
the hierarchy itself. Alternatively, the entire hierarchy can be visually represented, 
albeit vast amounts of display space can be obscured, hence wasted, simply to 

1 0 accommodate the structure without regard to the hierarchical data in the hierarchy 
itself. In the tree-map visualization technique, however, sections of the hierarchy 
containing more important information can be allocated more display space while 
portions of the hierarchy which are deemed less important to the specific task at 
hand can be allocated less space. More particularly, in operation, tree-maps 

1 5 partition the display space into a collection of rectangular bounding boxes 

representing the tree structure. The drawing of nodes within the bounding boxes 
can be entirely dependent on the content of the nodes, and can be interactively 
controlled. Since the display space size is user controlled, the drawing size of each 
node varies inversely with the size of the tree, for instance the number of nodes. 

20 Thus, trees having many nodes can be displayed and manipulated in a fixed display 
space, yet still be visible even when dealing with 1 million objects. 

Figure 1 illustrates a conventional tree map display 10. As seen in Figure 
1, a 10 by 10 display grid is filled with bounding boxes 12 through 68 representing 
the display of a data set containing twenty-nine entries. Data values associated 

25 with the twenty-nine entries establish the size of the bounding boxes and the color 
of the box, as represented by the different cross-hatch patterns illustrated in Figure 
1. Thus, a first data value may establish the size of the bounding box, for example, 
market capitalization if the data set represents different stocks, and a second data 
value may establish the color of the bounding box, for example, the change in 

30 stock price. Thus, in the example illustrated in Figure 1, the tree map display 10 is 
created from the data set of Table 1 below. 

Table 1 . Exemplary Data 
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A further example of the use of a tree map visualization is provided by 
Fidelity Investments' map of the stock market (which may be found at 
activequote.fidelity.com/rtrnews/market_map.phtml). In the Fidelity market map, 
the market is divided into sectors and the sectors are populated with bounding 
5 boxes for individual stocks. The size of the bounding boxes is based on the market 
capitalization of the stock and the color of the boxes are based on the price activity 
of the stock. 



SUMMARY OF THE INVENTION 

10 Embodiments of the present invention provide for displaying data from a 

first data set utilized in generating a first tree map visualization. The first data set 
is filtered so as to provide a second data set having a reduced amount of data 
relative to the first data set. A second tree map visualization is generated based on 
the second data set. 

15 In particular embodiments of the present invention, the second tree map 

visualization is generated based on the second data set so as to increase the size of 
bounding boxes associated with data common to both the first and the second data 
sets as compared to the first tree map visualization based on the first data set. The 
second tree map visualization may also be generated based on the second data set 

20 so as to decrease utilization of the processor in generating the tree map 

visualization based on the second data set as compared to processor utilization in 
generating the first tree map visualization based on the first data set. 

In still further embodiments of the present invention, the first data set is 
filtered based on at least one of data values of data elements of the first data set 

25 utilized in generating the first tree map, data values of data elements of the first 
data set that are not utilized in generating the first tree map and/or metadata 
associated with the data elements of the first data set. 

In additional embodiments of the present invention, filtering the first data 
set so as to provide a second data set having a reduced amount of data relative to 

30 the first data set is provided by filtering the first data set based on at least one 
threshold value of data utilized in generating the tree map visualization. 
Additionally, the threshold value may be set so as to provide a tree map 
visualization having a predefined minimum bounding box area. The predefined 
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minimum bounding box area may be greater than a minimum area for a bounding 
box that a display device is capable of displaying. The threshold value may also be 
set by evaluating data in the first data set to establish the threshold value so as to 
provide a tree map visualization having the predefined minimum bounding box 
5 area 

In yet other embodiments of the present invention, filtering the first data set 
so as to provide a second data set having a reduced amount of data relative to the 
first data set is provided by filtering the first data set such that the second data set 
provides a tree map visualization with a predefined characteristic. 

10 In additional embodiments of the present invention, a display is generated 

based on a third data set containing data filtered from the first data set to provide 
the third data set. The generated display may be a second tree map visualization. 

As will further be appreciated by those of skill in the art, while described 
above primarily with reference to method aspects, the present invention may be 

15 embodied as methods, apparatus/systems and/or computer program products. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 is an illustration of the display of a set of data utilizing a 
conventional tree map. 

Figure 2 is a block diagram of a data processing system suitable for use in 
20 embodiments of the present invention. 

Figure 3 is a more detailed block diagram of aspects of a data processing 
system that may be used in embodiments of the present invention. 

Figure 4 is an illustration of the display of a set of data of Figure 1 
utilizing a tree map according to embodiments of the present invention. 
25 Figure 5 is a flowchart illustrating operations according to embodiments of 

the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention now will be described more fully hereinafter with 
reference to the accompanying drawings, in which illustrative embodiments of the 
30 invention are shown. This invention may, however, be embodied in many different 
forms and should not be construed as limited to the embodiments set forth herein; 
rather, these embodiments are provided so that this disclosure will be thorough and 
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complete, and will fully convey the scope of the invention to those skilled in the 
art. Like numbers refer to like elements throughout. 

As will be appreciated by one of skill in the art, the present invention may 
be embodied as a method, data processing system, or computer program product. 
5 Accordingly, the present invention may take the form of an entirely hardware 
embodiment, an entirely software embodiment or an embodiment combining 
software and hardware aspects all generally referred to herein as a "circuit" or 
"module." Furthermore, the present invention may take the form of a computer 
program product on a computer-usable storage medium having computer-usable 

10 program code embodied in the medium. Any suitable computer readable medium 
may be utilized including hard disks, CD-ROMs, optical storage devices, a 
transmission media such as those supporting the Internet or an intranet, or 
magnetic storage devices. 

Computer program code for carrying out operations of the present 

1 5 invention may be written in an object oriented programming language such as 

Java®, Smalltalk or C++. However, the computer program code for carrying out 
operations of the present invention may also be written in conventional procedural 
programming languages, such as the "C" programming language. The program 
code may execute entirely on the user's computer, partly on the user's computer, as 

20 a stand-alone software package, partly on the user's computer and partly on a 
remote computer or entirely on the remote computer. In the latter scenario, the 
remote computer may be connected to the user's computer through a local area 
network (LAN) or a wide area network (WAN), or the connection may be made to 
an external computer (for example, through the Internet using an Internet Service 

25 Provider). 

The present invention is described in part below with reference to flowchart 
illustrations and/or block diagrams of methods, apparatus (systems) and computer 
program products according to embodiments of the invention. It will be 
understood that each block of the flowchart illustrations and/or block diagrams, 
30 and combinations of blocks in the flowchart illustrations and/or block diagrams, 
can be implemented by computer program instructions. These computer program 
instructions may be provided to a processor of a general purpose computer, special 
purpose computer, or other programmable data processing apparatus to produce a 
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machine, such that the instructions, which execute via the processor of the 
computer or other programmable data processing apparatus, create means for 
implementing the functions/acts specified in the flowchart and/or block diagram 
block or blocks. 

5 These computer program instructions may also be stored in a computer- 

readable memory that can direct a computer or other programmable data 
processing apparatus to function in a particular manner, such that the instructions 
stored in the computer-readable memory produce an article of manufacture 
including instruction means which implement the function/act specified in the 

10 flowchart and/or block diagram block or blocks. 

The computer program instructions may also be loaded onto a computer or 
other programmable data processing apparatus to cause a series of operational 
steps to be performed on the computer or other programmable apparatus to 
produce a computer implemented process such that the instructions which execute 

15 on the computer or other programmable apparatus provide steps for implementing 
the functions/acts specified in the flowchart and/or block diagram block or blocks. 

Embodiments of the present invention provide for selectively displaying 
data in tree map format on a display by filtering the data so as to display less than a 
complete set of data. Such filtering may provide a mechanism so as to reduce the 

20 volume of data displayed based on, for example, a predefined criteria. The 

predefined criteria may be a characteristic of the data itself or may be provided as 
metadata. 

Various embodiments of the present invention will now be described with 
reference to the figures. Figure 2 illustrates an exemplary embodiment of a data 

25 processing system 130 suitable for a server and network traffic associated with the 
replicated server in accordance with embodiments of the present invention. The 
data processing system 130 typically includes input device(s) 132 such as a 
keyboard, pointer, mouse and/or keypad, a display 134, and a memory 136 that 
communicate with a processor 138. The data processing system 130 may further 

30 include a speaker 144, and an I/O data port(s) 146 that also communicate with the 
processor 138. The I/O data ports 146 can be used to transfer information between 
the data processing system 130 and another computer system or a network. These 
components may be conventional components, such as those used in many 
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conventional data processing systems, which may be configured to operate as 
described herein. 

Figure 3 is a block diagram of data processing systems that illustrates 
systems, methods, and computer program products in accordance with 
5 embodiments of the present invention. The processor 138 communicates with the 
memory 136 via an address/data bus 248. The processor 138 can be any 
commercially available or custom microprocessor. The memory 136 is 
representative of the overall hierarchy of memory devices, and may contain the 
software and data used to implement the functionality of the data processing 

10 system 130. The memory 136 can include, but is not limited to, the following 
types of devices: cache, ROM, PROM, EPROM, EEPROM, flash memory, 
SRAM, and DRAM. 

As shown in Figure 3, the memory 136 may include several categories of 
software and data used in the data processing system 130: the operating system 

15 252; the application programs 254; the input/output (I/O) device drivers 258; and 
the data 256, which may include hierarchical data sets. As will be appreciated by 
those of skill in the art, the operating system 252 may be any operating system 
suitable for use with a data processing system, such as OS/2, AIX or System390 
from International Business Machines Corporation, Armonk, NY, Windows95, 

20 Windows98, Windows2000 or WindowsXP from Microsoft Corporation, 

Redmond, WA, Unix or Linux. The I/O device drivers 258 typically include 
software routines accessed through the operating system 252 by the application 
programs 254 to communicate with devices such as the I/O data port(s) 146 and 
certain memory 136 components. The application programs 254 are illustrative of 

25 the programs that implement the various features of the data processing system 130 
and preferably include at least one application that supports operations according 
to embodiments of the present invention. Finally, the data 256 represents the static 
and dynamic data used by the application programs 254, the operating system 252, 
the I/O device drivers 258, and other software programs that may reside in the 

30 memory 136. 

As is further seen in Figure 3, the application programs 254 may include a 
tree map module 260. The tree map module 260 may carry out the operations 
described herein for displaying a tree map diagram of a data set, such as the tree 
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map data 262. While the present invention is illustrated, for example, with 
reference to the tree map module 260 being an application program in Figure 3, as 
will be appreciated by those of skill in the art, other configurations may also be 
utilized. For example, the tree map module 260 may also be incorporated into the 
5 operating system 252, the I/O device drivers 258 or other such logical division of 
the data processing system 130. Thus, the present invention should not be 
construed as limited to the configuration of Figure 3 but encompasses any 
configuration capable of carrying out the operations described herein. 

One difficulty with tree map visualizations is that as the amount of data 
10 increases, for a given size tree map, the size of the average bounding box 

decreases. However, the size of the minimum bounding box is fixed. Thus, if 
there is a large amount of data that has a small size bounding box, this data may 
take up large amount of the tree map display but may provide little information to a 
user. 

15 For example, if a tree map display is a display of transaction times in a 

network and is to display one million transactions, if 500,000 of the transactions 
were at a minimum transaction time, then there would be 500,000 minimum size 
bounding boxes in the tree map. While the individual bounding boxes may be of 
minimum size, the aggregate space of the tree map may be significant. If the size 

20 of the bounding box reflects a relevance or significance of the underlying data, 
then a large amount of the display may be taken up be less relevant data. 
Furthermore, the overall size of the bounding boxes of the more relevant data may 
be decreased because the size of the average bounding box is decreased. 

Also, in generating a tree map visualization, the size of the bounding boxes 

25 is dependent on all the data being displayed. Thus, the processor, typically, 

evaluates all of the data and then generates the tree map based on this evaluation. 
In the present example, the processor would still evaluate the 500,000 elements 
even though these may be of little importance to the user. 

Embodiments of the present invention provide filtering of the data for a tree 

30 map visualization so as to reduce the amount of data displayed in the tree map. For 
example, the data set of Table 1 above may be filtered to remove elements with a 
first data value of less than or equal to two so as to provide a second data set. The 
second data set would only include elements corresponding to bounding boxes 12 
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through 30 of Figure 1. A tree map visualization is then generated from the 
second, reduced data set. An example of the generated tree map is illustrated in 
Figure 4. 

As seen in Figure 4, in comparison to Figure 1 the bounding boxes for 
5 corresponding elements of the data set of Table 1 have increased. In Figure 4, the 
bounding box 412 corresponds to the bounding box 12 of Figure 1, the bounding 
box 414 corresponds to the bounding box 14 of Figure 1, the bounding box 416 
corresponds to the bounding box 16 of Figure 1, the bounding box 418 
corresponds to the bounding box 18 of Figure 1, the bounding box 420 

10 corresponds to the bounding box 20 of Figure 1, the bounding box 422 
corresponds to the bounding box 22 of Figure 1, the bounding box 424 
corresponds to the bounding box 24 of Figure 1, the bounding box 426 
corresponds to the bounding box 26 of Figure 1, the bounding box 428 
corresponds to the bounding box 28 of Figure 1 and the bounding box 430 

15 corresponds to the bounding box 30 of Figure 1. By filtering the data set of Table 
1 to provide a reduced data set for generation of the tree map of Figure 4, the size 
of the bounding boxes have increased 20% over the bounding boxes of Figure 1 
for the corresponding data. Furthermore, the processor utilization in generation of 
the tree map of Figure 4 has been reduced from evaluating 29 data elements to 

20 evaluating 10 data elements. 

Figure 5 is a flowchart illustration that depicts operations for filtering 
content for display on a tree map pursuant to embodiments of the present 
invention. As shown in Figure 5, a first data set of tree map data is obtained 
(block 500). This first data set is filtered based on a criteria so as to provide a 

25 second data set having a reduced amount of data (block 502) over the first data set. 
The tree map is then generated from the reduced amount of data in the second data 
set (block 504). 

The criteria for filtering the data from the first data set may be a predefined 
threshold value. The data may be filtered based on any suitable comparison to the 
30 threshold value, for example, greater than, less than, and/or equal to the threshold 
value. The data may also be filtered based on multiple threshold values to provide 
data falling within a range or ranges of data. The threshold values may be 
statically set, user specified and/or dynamically determined. The dynamic 
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determination of thresholds may be provided based on an evaluation of the data 
values and/or a desired tree map characteristic. For example, the threshold value 
may be dynamically set to provide a data set with the 100 largest data values. 
Alternatively or additionally, the threshold value could be statically or dynamically 
5 set to provide a tree map with a predefined minimum size bounding box. Such a 
minimum bounding box size may be of greater area than the minimum area for a 
bounding box that a display device is capable of displaying. In such a case, a 
minimum value for the data value that determines the size of the bounding box 
could be preestablished or could be dynamically determined based on values in the 
10 data set. 

Furthermore, the data may be filtered based on the values of data that is 
displayed in the tree map, data associated with the data that is displayed in the tree 
map and/or metadata associated with the tree map data set. In the first instance, the 
data may be filtered based on one or more of the data values that are used in 

15 generating the tree map. For instance, in the stock market example, the data may 
be filtered based on a minimum market capitalization and/or a minimum change in 
stock price. Figure 4 is an example of such a filtering process. 

The data set could also be filtered based on additional data associated with 
the data that generates the tree map. For instance, in the stock market example, the 

20 data may be filtered based on stock market activity where only those stocks with a 
trading activity above a threshold value are included in the tree map. 

The data set could also be filtered based on metadata (i.e. data about the 
data) associated with the data set that generates the tree map. For example, the 
data set could be filtered based on a threshold that is set based on an average of 

25 values of the data in the data set. Similarly, the metadata may indicate a priority 

associated with elements of the data set and that priority used to filter the data. For 
instance, in the stock market example, stocks owned by a user and/or tracked by 
the user could be identified in metadata and the filtering based on whether a stock 
was owned, tracked and/or not prioritized. 

30 In further embodiments of the present invention, the data elements filtered 

from the first data set to provide the second data set that is displayed in the tree 
map could provide a third data set and an additional tree map generated based on 
the third data set so as to display the items not displayed in the tree map generated 
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based on the second data set. Thus, a user could toggle between a tree map based 
on the second data set and a tree map based on the third data set. Alternatively, the 
third data set could be displayed in an alternative display format, such as a table or 
conventional tree diagram. 
5 In still further embodiments of the present invention, the user may toggle 

the filtering on and off so as to toggle between a tree map based on all of the data 
elements of the first data set and a tree map that is based on the results of the 
filtering of the first data set. 

The flowcharts and block diagrams of Figures 2, 3 and 5 illustrate the 

10 architecture, functionality, and operation of possible implementations of systems, 
methods and computer program products for selectively controlling tree map 
graphical user interfaces according to various embodiments of the present 
invention. In this regard, each block in the flow charts or block diagrams may 
represent a module, segment, or portion of code, which comprises one or more 

15 executable instructions for implementing the specified logical function(s). It 

should also be noted that, in some alternative implementations, the functions noted 
in the blocks may occur out of the order noted in the figures. For example, two 
blocks shown in succession may, in fact, be executed substantially concurrently, or 
the blocks may sometimes be executed in the reverse order, depending upon the 

20 functionality involved. It will also be understood that each block of the block 
diagrams and/or flowchart illustrations, and combinations of blocks in the block 
diagrams and/or flowchart illustrations, can be implemented by special purpose 
hardware-based systems which perform the specified functions or acts, or 
combinations of special purpose hardware and computer instructions. 

25 In the drawings and specification, there have been disclosed typical 

illustrative embodiments of the invention and, although specific terms are 
employed, they are used in a generic and descriptive sense only and not for 
purposes of limitation, the scope of the invention being set forth in the following 
claims. 

30 



RSW9200300118US1 12 



